SUBSTITUTE SPECIFICATION 

DESCRIPTION 

METHOD FOR ENCODING MOVING IMAGE AND METHOD FOR 
DECODING MOVING IMAGE 

5 Technical Field 

This invention relates to a moving image encoding method for 
encoding a moving image signal after dividing the moving image 
signal composed of luminance components and chrominance 
components into blocks, and to a moving image decoding method for 
10 decoding the encoded data. 

Background Art 

In the age of multimedia which integrally handles audio, video 
and other pixel values, existing information media, specifically, 

15 newspaper, magazine, television, radio, telephone and the like 
through which information is conveyed to people, have recently 
come to be included in the scope of multimedia. Generally, 
multimedia refers to something that is represented by associating 
not only characters, but also graphics, sound, and especially images 

20 and the like, together, but in order to include the aforementioned 
existing information media in the scope of multimedia, it becomes a 
prerequisite to represent such information in a digital form. 

However, if the amount of information carried by each of the 
mentioned information media is estimated as the amount of digital 

25 information, while the amount of information for 1 character in the 
case of text is 1 to 2 bytes, the amount of information required for 
sound is 64 Kbits per second (telephone quality), and 100 Mbits or 
over per second becomes necessary for moving images (current 
television receiving quality), it is not realistic for the information 

30 media to handle such an enormous amount of information as it is in 
digital form. For example, although video phones are already in 
actual use via Integrated Services Digital Network (ISDN) which 
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offers a transmission speed of 64 kbps to 1.5 Mbps, it is impossible 
to transmit images on televisions and images taken by cameras 
directly through ISDN. 

Accordingly, information compression techniques have 
5 become required, and for example, in the case of the video phone, 
the H.261 and H.263 standards for moving image compression 
technology, internationally standardized by the International 
Telecommunication Union - Telecommunication Standardization 
Sector (ITU-T), are being employed. Moreover, with MPEG-1 

10 standard information compression techniques, it has also become 
possible to store video information onto general music compact 
discs (CD) together with audio information. 

Here, a Moving Picture Experts Group (MPEG) is an 
international standard for a moving image signal digital 

15 compression. The MPEG-1 is a standard for compressing moving 
image signals up to 1.5Mbps, in other words, compressing television 
signals up to approximately a hundredth of the original size. 
Moreover, since target picture quality within the scope of the 
MPEG-1 standard is limited to a medium degree of quality which can 

20 be realized by a transmission speed of primarily about 1.5Mbps, the 
use of MPEG-2, which was standardized to satisfy demands for 
further improved picture quality, realizes television broadcasting 
quality with moving image signals compressed to 2 to 15Mbps. 

Furthermore, a MPEG-4 having a higher compression ratio has 

25 been standardized by a working group (ISO/IEC JTC1/SC29/WG11) 
which has pursued standardizations of MPEG-1 and MPEG-2. The 
MPEG-4 not only enables an efficient coding with a low bit rate, but 
also introduces a strong error tolerance technique which can reduce 
subjective picture quality degradation even when a transmission line 

30 error has occurred. In addition, currently, as a next generation 
screen coding method, a standardization of H. 264 has been pursued 
by a cooperation of ISO/IEC with ITU-T. 
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In general, in encoding of a moving image, the amount of 
information is compressed by reducing redundancy in temporal and 
spatial directions. Here, in an inter prediction encoding for 
reducing temporal redundancy, a motion estimation and a 
5 generation of a prediction image are performed on a block-by-block 
basis by referring to preceding and following images, and encoding 
is performed on a difference between the obtained prediction image 
and a block to be encoded. Further, in an inter prediction encoding 
for reducing spatial redundancy, a prediction image is generated 

10 from pixel information of a neighboring encoded block, and the 
encoding is performed on a difference between the obtained 
prediction image and the block to be encoded. 

Here, a picture is a language indicating one screen. It 
indicates one frame when coded as a frame structure. Also, it 

15 indicates one field when coded as a field structure. 

Each picture is divided into blocks called micro blocks, for 
example, each of which is horizontal 16 x vertical 16 pixels, and is 
processed on a block-by-block basis. The picture of the field 
structure is encoded by including all micro blocks as a field macro 

20 block. On the other hand, the picture of the frame structure can be 
encoded not only by using all micro blocks as a frame macro block, 
but also by switching to a frame or a field on a basis of vertically 
sequential two macro blocks as one unit (a macro block pair). 

FIG. 1 is a block diagram showing a structure of a moving 

25 image encoding apparatus which realizes a conventional moving 
image encoding method. The moving image encoding apparatus 
includes a picture memory 101, a prediction residual encoding unit 
102, a bit stream generation unit 103, a prediction residual decoding 
unit 104, a deblocking unit 105, a picture memory 106, an encoding 

30 mode controlling unit 107, an inter prediction image generation unit 
108 and an intra prediction image generation unit 109. 

In a displaying order of the bit stream to be encoded, the bit 
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stream is inputted to the picture memory 101 on a 
picture-by-picture basis, and the pictures are sorted in an encoding 
order. Further, each of the pictures is divided into macro blocks and 
the following processing is applied on each micro block. 
5 There are mainly two types of encoding methods: an inter 

prediction encoding; and an intra prediction encoding. Here, it is 
explained first about the inter prediction encoding. 

An input image signal read out from the picture memory 101 
is inputted to a difference arithmetic unit 110, and a difference 

10 image signal obtained by calculating a difference with the prediction 
image signal that is an output of the inter prediction image 
generation unit 108 is outputted to the prediction residual encoding 
unit 102. The prediction residual encoding unit 102 performs 
image encoding processing such as frequency conversion and 

15 quantization so as to output a residual signal. The residual signal is 
inputted to the prediction residual decoding unit 104, and an image 
decoding processing such as inverse quantization and inverse 
frequency conversion is performed so as to output a residual 
decoded signal. A sum arithmetic unit 111 adds the residual 

20 decoded signal and the prediction image signal so as to generate a 
reconstructed image signal. The reconstructed image signal is 
processed for reducing distortion which occurs in a boundary 
between blocks that are divided when encoding is performed by the 
deblocking processing unit 105, before being stored as a reference 

25 picture into the picture memory 106. 

On the other hand, the input image signal on a macro 
block-by-block basis read out from the picture memory 101 is also 
inputted to the inter prediction image generation unit 108. Here, 
targeting one picture stored in the picture memory 106 or encoded 

30 pictures, an image area which is closest to the input image signal is 
detected and outputted as a prediction image. The prediction 
image is used for generating a difference image signal in the 
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difference arithmetic unit 110 and for generating a reconstructed 
image signal in the sum arithmetic unit 111. 

The bit stream generation unit 103 performs variable length 
encoding on various encoded information outputted by the series of 
5 above processing so as to obtain a bit stream (moving image 
encoded data) to be outputted by the encoding processing. 

While this flow of processing is an operation in the case where 
the inter prediction encoding is performed, it is switched to the intra 
prediction encoding by the switch 112. Hereafter, it is explained 

10 about the intra prediction encoding. 

The input image signal read out from the picture memory 101 
is inputted to the difference arithmetic unit 110, and the difference 
image signal obtained by calculating a difference with the prediction 
image signal that is an output of the intra prediction image 

15 generation unit 109 is outputted to the prediction residual encoding 
unit 102. The prediction residual encoding unit 102 performs 
image encoding processing such as frequency conversion and 
quantization so as to output a residual signal. The residual signal is 
inputted to the prediction residual decoding unit 104, and the image 

20 decoding processing such as inverse conversion and inverse 
frequency conversion is performed so as to output the residual 
decoded signal. The sum arithmetic unit 111 adds the residual 
decoded signal and the prediction image signal, and generates a 
reconstructed image signal. The reconstructed image signal is 

25 processed for reducing distortion which occurs in a boundary 
between blocks that are divided when the deblocking processing unit 
104 performs encoding. 

On the other hand, the input image signal read out from the 
picture memory 101 on a macro block-by-block basis is also inputted 

30 to the intra prediction image generation unit 109. Here, a 
prediction image is generated by referring to the reconstructed 
image signal of one or more neighboring blocks in the same picture 

-5- 



generated as an output of the sum arithmetic unit 111. The 
prediction image is used for generating a difference image signal in 
the difference arithmetic unit 110 and for generating a 
reconstructed image signal in the sum arithmetic unit 111. 
5 The bit stream generation unit 103 performs variable length 

encoding on various encoded information outputted by the series of 
processing so that a bit stream outputted by the encoding 
processing is obtained. 

Each encoding mode of the inter prediction encoding and the 

10 intra prediction encoding is controlled by the encoding mode control 
unit 107 and switched on a macro block-by-block basis. 

FIG. 2 is a block diagram showing a structure of a moving 
image decoding apparatus which realizes a conventional moving 
image decoding method. The moving image decoding apparatus 

15 includes a bit stream analyzing unit 201, a prediction r esidual 
decoding unit 202, a deblocking unit 203, a picture memory 204, a 
decoding mode controlling unit 205, an inter prediction image 
generation unit 206 and an intra prediction image generation unit 
207. 

20 First, the bit stream analyzing unit 201 extracts various 

information from the inputted bit stream (moving image encoded 
data), and the information relating to a decoding mode and the 
residual coded signal are respectively outputted to the decoding 
mode controlling unit 205 and the prediction residual decoding unit 

25 202. 

There are two types of decoding methods: an inter prediction 
decoding; and an intra prediction decoding. Here, it is explained 
first about an inter prediction decoding. 

The prediction residual decoding unit 202 performs image 
30 decoding processing such as inverse quantization and inverse 
frequency conversion on the inputted residual encoded signal, and 
outputs the residual decoded signal. The sum arithmetic unit 208 
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adds the residual decoded signal and the prediction image signal 
outputted from the inter prediction image generation unit 206, and 
generates a decoded image signal. The deblocking unit 203 
performs processing for reducing distortion which occurs in a 
5 boundary between blocks on the decoded image signal before being 
stored into the picture memory 204 as a picture for reference or 
display. 

On the other hand, the inter prediction image generation unit 

206 takes out a specified image area from one or more decoded 
10 pictures stored in the picture memory 204, and generates a 

prediction image. The prediction image is used for generating a 
decoded image signal by the sum arithmetic unit 208. 

The decoded image generated by the series of processing is 
outputted as an image signal for display from the picture memory 

15 204 according to the timing to be displayed. 

While the flow of processing is an operation in the case where 
the inter prediction decoding is performed, it is switched to the intra 
prediction decoding by the switch 209. Hereafter, it is explained 
about the intra prediction encoding. 

20 The prediction residual decoding unit 202 performs image 

decoding processing such as inverse quantization and inverse 
frequency conversion on the inputted residual encoded signal, and 
outputs a residual decoded signal. The sum arithmetic unit 208 
adds the residual decoded signal and the prediction image signal 

25 outputted from the intra prediction image generation unit 207, and 
generates a decoded image signal. The deblocking processing unit 
203 performs processing for reducing distortion which occurs in a 
boundary between blocks on the decoded image signal before being 
stored in the picture memory 204 as a picture for display. 

30 On the other hand, the intra prediction image generation unit 

207 generates a prediction image by referring to a decoded image 
signal of one or more neighboring blocks in the same picture 
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generated as an output of the sum arithmetic unit 208. The 
prediction image is used for generating the decoded image signal in 
the sum arithmetic unit 208. 

The decoded image generated by the series of processing is 
5 outputted as an image signal for display from the picture memory 
204 according to the timing to be displayed. 

Note that, each of the decoding modes of the inter prediction 
decoding and the intra prediction decoding is controlled by the 
decoding mode controlling unit 205 and switched on a macro 
10 block-by-block basis. 

Next, it is explained in detail about processing in the 
deblocking units 105 and 203. Here, the processing details in the 
encoding processing and the processing details in the decoding 
processing are precisely identical to each other. Therefore, they 
15 are explained together as the same processing. 

FIGS. 3A and 3B are drawings for explaining a method of 
determining types of filters used for deblocking. Here, as an 
example, assuming that there are five types of filters, the filters are 
used by being switched according to a characteristic of a block 
20 boundary. It is structured to apply a stronger filter (here indicates 
Filter 4) to a portion where has a higher possibility in which block 
distortion eminently occurs, and a weaker filter (here indicates Filter 
0) to a portion where has a lower possibility in which block distortion 
eminently occurs. 

25 FIG. 3A is a drawing showing a boundary between blocks to 

which filters are applied. In the drawing, the center line indicates a 
boundary between blocks; a pixel on the right side shown as Q 
indicates a pixel adjacent to the boundary in the target block; and 
the pixel on the left side shown as P indicates a pixel adjacent to the 

30 boundary in the adjacent block. FIG. 3B is a table showing which 
filters are selected with which conditions the pixel P and the pixel Q 
shown in FIG. 3B have. For example, the Filter 4 is selected in the 
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case where the boundary is at a vertical edge and one of the pixels 
P and Q belongs to a block which is intra prediction encoded. 
Similarly, the Filter 3 is selected in the case where the boundary is 
at a horizontal edge and one of the pixels P and Q belongs to a block 
5 which is intra prediction encoded. Also, the Filter 2 is selected in 
the case where one of the pixels P and Q belongs to a block which 
has a coefficient other than 0 of a spatial frequency component 
converted by frequency conversion. Further, the Filter 1 is selected 
in the case where the pixels P and Q belong to a block which is inter 

10 prediction encoded and refer to respectively different pictures or 
different motion vectors. Furthermore, the Filter 0 is selected in 
the case where it is not applied to any of the above conditions. 

Here, the table of FIG. 3B shows an example of a method of 
selecting filters. The number of filters and the selection conditions 

15 are not limited to the example. Therefore, the other cases can be 
similarly treated. 

Next, a flow of deblocking processing is explained with 
reference to a flowchart shown in FIG. 4. The target data is 
managed in separated forms of data for luminance and data of 

20 chrominance. Therefore, deblocking is separately applied to each 
component. 

First, in order to perform deblocking on luminance 
components, a loop processing is repeated as many as the number 
of pixels of the luminance component adjacent to a target block 

25 boundary (Fl and F4); a type of filter explained with reference to 
FIG. 3 is selected in each loop (F2); and the filter is applied (F3). 
The information of the type of the selected filter herein is stored in 
the memory region in which the information can be seen in later 
processing, while being used for applying filtering on the target pixel 

30 of the luminance component (F5). In order to target, for each pixel, 
a boundary which is at a vertical edge on the left side and a 
boundary which is a horizontal edge on the upper side, the above 
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mentioned processing is applied eight times in the case of a block 
which is, for example, made up of horizontal 4 x vertical 4 pixels. 

Next, in order to perform deblocking on chrominance 
components, a loop processing is turned as many as the number of 
5 pixels of the chrominance components adjacent to the target block 
boundary (F6 and F10); a type of filter is selected in each loop (F8); 
and the selected filter is applied (F9). Herein, a filter to be applied 
is determined according to the type of the filter used for the 
luminance component. Specifically, the type of the filter applied at 

10 a position of a pixel of the corresponding luminance component is 
referred to and used from the memory region in which the 
information about the type of filter determined in the processing for 
the luminance component is stored. Here, the following equations 
are used for converting the position of a target pixel of the 

15 chrominance component to a position of a corresponding pixel of the 
luminance component (F7). Note that, XL indicates a horizontal 
coordinate value of the luminance, XC indicates a horizontal 
coordinate value of the chrominance, YL indicates a vertical 
coordinate value of the luminance, and YC indicates a vertical 

20 coordinate value of the chrominance. 
XL=2xXC (equation 1(a)) 
YL=2xYC (equation 1(b)) 

Thus, deblocking is performed on the chrominance 
component by applying a filter determined by the above mentioned 

25 processing. 

Next, it is explained about a relationship between the 
luminance components and the chrominance components. FIGS. 
5A-5C are drawings for explaining positional relationships between 
the luminance components and the chrominance components. In 

30 the drawings, x mark indicates a sample position of the luminance 
component and O mark indicates a sample position of the 
chrominance component. 
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In general, eyes of humans are insensitive to changes of the 
chrominance components. Therefore, it is more likely that the 
color components are decimated for use. While there are various 
decimation methods, FIG. 5A indicates a positional relationship in 
5 the case where the chrominance components are decimated to a half 
amount in both vertical and horizontal directions. FIG. 5B indicates 
a positional relationship in the case where chrominance components 
are decimated to a half amount only in a horizontal direction. FIG. 
5C indicates a positional relationship in the case where decimation is 

10 not performed. In the case of the positional relationship as shown 
in FIG. 5A, the equation 1(a) and the equation 1(b) are used for 
calculating a pixel position of the corresponding luminance 
component for deblocking the chrominance component. 

Further, FIGS. 6A-6C show a positional relationship in a frame 

15 structure and in a field structure in the case where the chrominance 
components are decimated to a half amount in both vertical and 
horizontal directions. FIG. 6A shows a frame structure when 
processing is performed after decimating the chrominance 
components. FIG. 6B shows a field structure which is replaced to 

20 the frame structure. Specifically, zeroth, second and fourth lines of 
the luminance components are assigned to a top field and first, third, 
and fifth lines are assigned to a bottom field. (Refer to: ITU-T Rec. 
H. 264 | ISO/IEC 14496-10 AVC Draft Text of Final Draft 
International Standard (FDIS) of Joint Video Specification 

25 (2003-3-31)). 

However, in the conventional structure, the type of a filter 
used for the luminance component at a pixel position converted 
using the equation 1(a) and the equation 1(b) is applied to a pixel of 
the chrominance component. Therefore, there was a problem of 

30 causing inconsistency that a filter to be applied to the chrominance 
component is determined by referring to the luminance component 
in the top field, in the case where an image to be displayed in an 
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interlaced-scan form is encoded and decoded in the frame structure. 
FIGS. 7A and 7B are drawings for explaining the reference 
relationship therein. FIG. 7A shows a positional relationship 
between the luminance components and the chrominance 
5 components when the picture is encoded and decoded in the frame 
structure. FIG. 7B shows a positional relationship between the 
luminance components and the chrominance components when the 
image is replaced to the field structure. Here, L_0 indicates a 
position of the luminance components at zeroth line, and C_0 

10 indicates a position of the chrominance components at zeroth line. 
It is indicated that the luminance component at L_2 is referred by 
the equation 1(b) when a deblocking filter is applied to the color 
component of C_l. However, when the pixel is replaced to the field 
structure, it is found that a type of filter is determined by which the 

15 chrominance at C_l in the bottom field refers to the luminance 
component at L_2 in the top field. 

As described above, in the picture having the frame structure, 
all macro blocks can be encoded not only as frame macro blocks but 
also by switching to the frame structure or the field structure on a 

20 macro block pair-by-pair basis. Also, in the case where the picture 
is encoded in the field structure, it is possible to use respective 
encoding modes for the top field and the bottom field. 

Accordingly, for example, in the case where an intra 
prediction encoding mode is used in the top field and an inter 

25 prediction encoding mode is used in the bottom field, picture quality 
is degraded in the chrominance components in the bottom field. In 
other words, basically, a strong filter is adapted for the intra 
prediction encoding mode and a weak filter is adapted for the inter 
prediction encoding mode. Therefore, an originally weak filter 

30 should be applied to the chrominance components in the bottom 
field. However, as described in the above, a type of filter for the 
chrominance components in the bottom field is determined by 
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referring to the luminance components in the top field. Therefore, 
a strong filter is adapted. Consequently, the picture quality is 
degraded in the chrominance components in the bottom field so that 
the image is not consistent when it is displayed in the 
5 interlaced-scan form. 

Further, the same thing is also applied to the case where, 
even if a same encoding mode is used for the top field and the 
bottom field of the target macro block, an adjacent macro block is 
encoded in the field structure and respective encoding modes are 

10 used for the top field and the bottom field. 

As described in the above, in the case where the image to be 
displayed in the interlaced-scan form is encoded and decoded in the 
frame structure, there is a problem that an inappropriate type of 
filter is applied because there is a case where the type of filter 

15 applied to the chrominance components is determined by referring 
to the luminance components in a different field. 

Disclosure of Invention 

In order to solve the conventional problems, it is an object of 
20 the present invention to provide a moving image encoding method 
and a moving image decoding method for enabling to apply a most 
appropriate type of filter for deblocking even in the case where a 
image to be displayed in an interlaced-scan form is encoded and 
decoded. 

25 In order to achieve the aforementioned object, a moving 

image encoding method according to the present invention is a 
moving image encoding method for encoding an inputted picture on 
a block-by-block basis after dividing the picture into blocks, the 
picture including luminance components and chrominance 

30 components, the method including: a prediction encoding step of 
encoding the picture by referring to an encoded area in the picture 
or a different encoded picture; a reconstructed picture generation 
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step of reconstructing the picture using encoded data generated in 
the prediction encoding step; and a deblocking step of reducing 
distortion by filtering a boundary between blocks of the 
reconstructed picture generated in the reconstructed picture 
5 generation step, wherein in the deblocking step, filters are 
respectively applied to the luminance components and the 
chrominance components: a type of the filters is selected based on 
encoding information of each luminance component and the 
selected filter is applied to the luminance component; and a filter 

10 selected for a luminance component, which belongs to a same field 
as a chrominance component and corresponds to the chrominance 
component, is applied to the chrominance component. 

Accordingly, the same type of deblocking filter is applied to 
the luminance components and chrominance components in the 

15 same field. Therefore, an encoding so as to generate a consistent 
image when the image is displayed in the interlaced-scan form after 
the image is decoded. Further, it is possible to combine a structure 
of the present invention easily into the conventional structure 
because there are very few changes from the conventional 

20 structure. 

Also, a moving image decoding method according to the 
present invention is a moving image decoding method for decoding 
encoded moving image data obtained by encoding, on a 
block-by-block basis, a picture which is divided into blocks, the 

25 picture including luminance components and chrominance 
components, the method including: a prediction decoding step of 
decoding the picture by referring to a decoded area in the picture or 
a different decoded picture; and a deblocking step of reducing 
distortion by filtering a boundary between blocks of the decoded 

30 picture generated in said prediction decoding step, wherein in the 
deblocking step, filters are respectively applied to the luminance 
components and the chrominance components: a type of the filters 
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is selected based on encoding information of the luminance 
component and the selected filter is applied to each luminance 
component; and a filter selected for a luminance component, which 
belongs to a same field as a chrominance component and 
5 corresponds to the chrominance component, is applied to the 
chrominance component. 

Accordingly, a same type of deblocking filter is applied to the 
luminance components and chrominance components in the same 
field. Further, it is possible to combine a structure of the present 

10 invention easily into the conventional structure because there are 
very few changes from the conventional structure. 

Note that, the present invention is not only realized as such 
moving image encoding method and the moving image decoding 
method, but also as a moving image encoding apparatus and a 

15 moving image decoding apparatus having characteristic steps 
included in the moving image encoding method and the moving 
image decoding method as units. Further, it can be realized as a 
program for causing a computer to execute those steps and as 
moving image data encoded by the moving image encoding method. 

20 Furthermore, such program and the encoded moving image data can 
be distributed via a recording medium such as a CD-ROM and a 
transmission medium such as the Internet. 

According to the moving image encoding method by the 
present invention, same type of deblocking filter is applied to the 

25 luminance components and the chrominance components in the 
same field. Therefore, it is possible to perform encoding so as to 
generate a consistent image when the image is displayed in the 
interlaced-scan form after the image is decoded. Further, it is 
possible to combine a structure of the present invention easily into 

30 the conventional structure because there are few changes from the 
conventional structure. 

In addition, according to the moving image decoding method 
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by the present invention, a same type of deblocking filter is applied 
to the luminance components and chrominance components in the 
same field. Therefore, it is possible to generate a consistent image 
in the case of displaying the image in the interlaced-scan form. 
5 Further, it is possible to combine a structure of the present invention 
easily into the conventional structure because there are very few 
changes from the conventional structure. 

Brief Description of Drawings 

10 FIG. 1 is a block diagram showing a structure of a 

conventional moving image encoding apparatus. 

FIG. 2 is a block diagram showing a structure of a 
conventional moving image decoding apparatus. 

FIGS. 3A and 3B are schematic drawings showing a method of 
15 determining types of deblocking filters. 

FIG. 4 is a flowchart for explaining a flow of deblocking 
processing in the conventional moving image encoding apparatus 
and the moving image decoding apparatus. 

FIGS. 5A-5C are schematic drawings showing a relationship of 
20 sample positions of chrominance components and luminance 
components. 

FIGS. 6A and 6B are other schematic drawings showing a 
relationship of sample positions of chrominance components and 
luminance components. 
25 FIGS. 7A and 7B are schematic drawings showing a 

relationship when the pixel positions of the chrominance 
components are converted to the pixel positions of the luminance 
components. 

FIG. 8 is a flowchart for explaining a flow of deblocking 
30 processing in the moving image encoding apparatus and the moving 
image decoding apparatus in a first embodiment. 

FIGS. 9A and 9B are other schematic drawings showing a 
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relationship when the pixel positions of the chrominance 
components are converted to the pixel positions of the luminance 
components. 

FIGS. 10A and 10B are flowcharts for explaining flows of other 
5 deblocking processing in the moving image encoding apparatus and 
the moving image decoding apparatus in the first embodiment. 

FIGS. 11A and 11B are other schematic drawings showing a 
relationship when the pixel positions of the chrominance 
components are converted to the pixel positions of the luminance 
10 components. 

FIGS. 12A and 12B are other schematic drawings showing a 
relationship when the pixel positions of the chrominance 
components are converted to the pixel positions of the luminance 
positions. 

15 FIGS. 13A-13C are drawings for explaining about a recording 

medium for storing a program for causing a computer system to 
execute the moving image encoding method and the moving image 
decoding method in each embodiment: FIG. 13A is an illustration 
showing an example of a physical format of a flexible disc which is a 

20 recording medium body; FIG. 13B is an illustration showing an outer 
view seen from the front of the flexible disc, a cross-section 
structure, and flexible disc; and FIG. 13C is an illustration showing 
a structure for recording and reproducing the program on the 
flexible disc FD. 

25 FIG. 14 is a block diagram showing an overall structure of a 

content supply system for realizing content distribution service. 

FIG. 15 is a schematic diagram showing an example of a cell 

phone. 

FIG. 16 is a block drawing showing an internal structure of the 
30 cell phone. 

FIG. 17 is a block diagram showing an overall structure of a 
system for digital broadcast. 
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Best Mode for Carrying Out the Invention 

Hereafter, it is explained about specific embodiments of the 
present invention with references to the drawings. 
5 (First embodiment) 

The structure of a moving image encoding apparatus 
according to a flow of encoding processing as a whole in a first 
embodiment of the present invention is completely the same as the 
conventional structure explained using FIG. 1. Therefore, an 

10 explanation about the same structure is omitted in here. The 
structure of the moving image encoding apparatus only differs from 
the conventional structure in a deblocking method used by the 
deblocking unit 105 shown in FIG. 1. Also, the structure of the 
moving image decoding apparatus according to a flow of decoding 

15 processing as a whole is completely the same as the conventional 
structure explained using FIG. 2. Therefore, the explanation is 
omitted in here. It differs from the conventional structure only in a 
deblocking method used by a deblocking unit 203 shown in FIG. 2. 
Hereafter, it is explained in detail about the processing in the 

20 deblocking units 105 and 203. Here, the processing details in the 
encoding processing and in the decoding processing are completely 
the same so that they are explained together. 

The deblocking processing differs depending on a method of 
decimating chrominance components in the structure of the moving 

25 image data. Here, three types of decimation methods are 
respectively explained: (1) it is explained about the case where, as 
shown in FIG. 5A, the moving image data in which the chrominance 
components are decimated to a half amount in both vertical and 
horizontal directions is inputted; (2) it is explained about a case 

30 where, as shown in FIG. 5B, the moving image data in which the 
chrominance components are decimated to a half amount only in the 
horizontal direction is inputted ; and (3) it is explained about a case, 
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as shown in FIG. 5C, where the moving image data in which the 
chrominance components are not decimated is inputted. Note that, 
in the following explanation, it is assumed that a target block is 
encoded and decoded as a frame structure. 
5 (1) half decimation in vertical and horizontal directions 

FIG. 8 is a flowchart for explaining a flow of deblocking 
processes in the case where the moving image in which the 
chrominance components are decimated to a half amount in both 
vertical and horizontal directions is inputted. The target moving 

10 image data is managed by being divided into luminance components 
and chrominance components. Therefore, deblocking is 
independently performed on the respective components. 

First, in order to perform deblocking on the luminance 
components, a loop processing is repeated as many times as the 

15 number of pixels of the luminance components adjacent to the 
target block boundary (Fl and F4), a type of filter is selected using 
encoding information of a pixel of a target luminance component in 
each loop (F2), and the selected filter is applied (F3). The 
information about the type of filter selected herein is stored in a 

20 memory region in which the information can be referred in latter 
processing, while being used for applying filtering to the pixel of the 
target luminance component (F5). Since a boundary formed of a 
left vertical edge and a boundary formed of an upper horizontal edge 
are targets for each block, the processing is repeated eight times in 

25 the case of a block, for example, made up of horizontal 4 x vertical 
4 pixels. 

It is explained about a method for determining types of filters 
with reference to FIG. 3. Here, assuming that there are five types 
of filters as an example, the filters are used by switching them 
30 depending on a characteristic of a block boundary. It is structured 
that a stronger filter (here indicated as Filter 4) is applied to a 
portion where there is a higher possibility that block distortion is 
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eminently generated and that a weaker filter (here indicated as 
Filter 0) is applied to a portion where there is a lower possibility that 
block distortion is eminently generated. 

FIG. 3A is a drawing showing a boundary between blocks to 
5 which filters are applied. In the drawing, a center line indicates a 
boundary between blocks, a pixel on the right side shown as Q 
indicates a pixel adjacent to the boundary in a target block, and a 
pixel on the left side shown as P indicates a pixel adjacent to the 
boundary in an adjacent block. FIG. 3B is a table showing which 
10 filters are selected for which conditions the pixel P and the pixel Q 
shown in FIG. 3A have. For example, the Filter 4 is selected in the 
case where the boundary is at the vertical edge and one of the pixel 
P and the pixel Q belongs to a block which is intra prediction 
encoded. 

15 Note that, the table of FIG. 3B only shows an example of a 

method of selecting filters and the number of filters and the 
selection conditions are not limited to the example. Therefore, 
other cases can be similarly treated. 

The information about the type of filter determined by the 

20 mentioned processing is used for applying filtering on the target 
pixel of the luminance component, while being stored in the memory 
region in which the information can be referred to in latter 
processing. 

Next, in order to perform deblocking on the chrominance 
25 components, a loop processing is repeated as many times as the 
number of pixels of the chrominance components adjacent to the 
target block boundary (F6 and F10); a type of filter is selected in 
each loop (F8); and the selected filter is applied (F9). Herein, a 
filter to be applied to the chrominance component is determined 
30 based on the type of filter used for the luminance component. 
Specifically, the type of filter applied at a pixel position of the 
corresponding luminance component is referred and used from the 
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memory region in which the information about the type of filter 
determined in the processing for the luminance component is stored. 
Herein, the pixel position of the target chrominance component is 
converted to the pixel position of the corresponding luminance 
5 component (F7a). 

The conversion of information about the position of a pixel is 
performed so as to have the position of the pixel of the luminance 
component assigned to a same field as a target pixel of the 
chrominance component when the target block is replaced to the 

10 filed structure. Here, since the chrominance components are 
decimated to the half amount in both vertical and horizontal 
directions, the conversion is performed using the following 
equations. Note that, it is assumed that marks indicate the 
following: XL indicates a horizontal coordinate value of the 

15 luminance; XC indicates a horizontal coordinate value of the 
chrominance; YL indicates a vertical coordinate value of the 
luminance; and YC indicates a vertical coordinate value of the 
chrominance. Further, the mark % in the equation 2(b) indicates 
an operator which operates to add a remainder when division is 

20 performed. 

XL=2xXC (equation 2(a)) 

YL=2xYC+YC%2 (equation 2(b)) 

By applying the filter determined by the mentioned processes, 
deblocking is performed on the chrominance components. Since a 

25 boundary formed of a left vertical edge and a boundary formed of an 
upper horizontal edge are targeted for each block, for example, in 
the case where the luminance components form a block made up 
with horizontal 4 x vertical 4 pixels, here explains as an example of 
decimating the chrominance components to the half amount in both 

30 vertical and horizontal directions so that the mentioned processing 
is repeated four times to make the chrominance components to have 
horizontal 2 x vertical 2 pixels. 
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FIGS. 9A and 9B are drawings for explaining a relationship 
between positions of pixels of target chrominance components and 
positions of pixels of reference luminance components that are 
converted using the equations 2(a) and 2(b). FIG. 9A shows a 
5 positional relationship between the luminance components and the 
chrominance components in the frame structure. FIG. 9B shows a 
positional relationship between the luminance components and the 
chrominance components in the case where the image is replaced to 
the field structure. Here, L_0 indicates a position of a luminance 

10 component at zeroth line and C_0 indicates a position of a 
chrominance component at zeroth line. In the case where a 
deblocking filter is applied to the chrominance components of C_l, 
it is indicated that the luminance components of L_3 are referred 
using the equation 2(b). Replacing the pixels herein to the field 

15 structure, it is found that the chrominance components of C_l in the 
bottom field refer to the luminance components of L_3 in the same 
bottom field so as to determine a type of filter. 

In the conventional method, as explained using FIGS. 7A and 
7B, in the case where the deblocking filter is applied to the 

20 chrominance components of C_l, it is assumed to refer to the 
luminance components of L_2 using the equation 1(b). In the 
deblocking, there is a possibility that a type of applied filter differs 
for each field in the case where a target block is in a frame structure 
but the adjacent block is in a field structure. That is, the result 

25 largely differs, for the filter to be applied for the chrominance 
components C_l, depending on whether using the filter used for the 
luminance components in the top field or using the filter used for the 
luminance components in the bottom field. Consequently, a gap is 
generated in a degree of compensation of a pixel value because a 

30 filter to be used is different between the luminance components and 
the chrominance components in the bottom field. 

However, by using a conversion method according to the 
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present invention, a same type of deblocking filter is used for the 
luminance component and the chrominance component which are 
making up a pixel in the same field. Therefore, it is possible to 
perform encoding and decoding so as to generate a consistent image 
5 when the image is displayed in the interlaced-can form. 

Here, instead of using the equations 2(a) and 2(b) for 
converting the pixel position of a target chrominance component 
into the pixel position of a corresponding luminance component, the 
following equations also can be used. Note that, XL indicates a 
10 horizontal coordinate value of the luminance; XC indicates a 
horizontal coordinate value of the chrominance; Y L indicates a 
vertical coordinate value of the luminance; a nd YC indicates a 
vertical coordinate value of the chrominance. 

XL=2*XC (equation 2(al)) 

15 YL=2*YC (equation 2(bl)) 

YL=2xYC+l (equation 2(b2)) 

In the case where the chrominance component belongs to the 
top field, a conversion is performed using the equations 2(al) and 
2(bl), and in the case where the chrominance component belongs to 
20 the bottom field, a conversion is performed using the equations 
2(a2) and 2(b2). 

(2) half decimation in a horizontal direction 

FIG. 10A is a flowchart for explaining a flow of deblocking 
processing in the case where an image in which the chrominance 
25 components are decimated to the half amount only in a horizontal 
direction is inputted. It differs with FIG. 8 only in the deblocking 
for the chrominance components. Therefore, explanation about 
the same processing other than that is omitted in here. 

In order to perform deblocking on the chrominance 
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components, a loop processing is repeated as many times as the 
number of pixels of the chrominance components adjacent to the 
target block boundary (F6 and F10); a type of filter is selected in 
each loop (F8); and the selected filter is applied (F9). Herein, a 
5 filter to be applied to the chrominance component is determined 
based on a type of filter used for the luminance component. 
Specifically, the type of filter is used by referring to the type of filter 
for the corresponding luminance component used at the pixel 
position from the memory region in which information about the 

10 type of filter determined by the processing performed on the 
luminance component is stored. Herein, the pixel position of the 
target chrominance component is converted to the pixel position of 
the corresponding luminance component (F7b). 

The conversion of information about a position of a pixel is 

15 performed so as to have a position of a pixel of a luminance 
component assigned to a same field as a field assigned to the pixel 
of the target chrominance component when the target block is 
replaced to the field structure. Here, the chrominance components 
are decimated to the half amount only in a horizontal direction. 

20 Therefore, the conversion is performed using the following 
equations. Note that, XL indicates a horizontal coordinate value of 
the luminance; XC indicates a horizontal coordinate value of the 
chrominance; YL indicates a vertical coordinate value of the 
luminance; and YC indicates a vertical coordinate value of the 

25 chrominance. 

XL=2xXC (equation 3(a)) 

YL=YC (equation 3(b)) 

By applying a filter determined by the processing, the 
deblocking is performed on the chrominance components. For each 

30 block, a boundary made up of a vertical edge on a left side and a 
boundary made up of a horizontal edge on the upper side are 
targeted. Therefore, in the case where the luminance component is 
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a block that is, for example, made up of horizontal 4 x vertical 4 

pixels, the chrominance is horizontal 2 x vertical 4 pixels because 

here explains an example of decimating the chrominance 
components to the half amount only in the horizontal direction. 
5 Consequently, the above processing is repeated six times. 

FIGS. 11A and 11B 11 are drawings for explaining a 
relationship between positions of pixels of target chrominance 
components and positions of pixels of reference luminance 
components converted using the equations 3(a) and 3(b). FIG. 11A 

10 shows a positional relationship between the luminance components 
and the chrominance components in the frame structure. FIG. 11B 
shows a positional relationship between the luminance components 
and the chrominance components in the case of replacing the image 
to the field structure. Here, L_0 indicates a position of the 

15 luminance components at zeroth line and C_0 indicates a position of 
the chrominance components at zeroth line. In the case of applying 
the deblocking filter to the chrominance components of C_l, it is 
indicated to refer to the L_l luminance components of L_l using the 
equation 3(b). Replacing the pixels herein to the field structure, it 

20 is found that the type of filter for the chrominance components of 
C_l in the bottom field is determined by referring to the luminance 
components of L_l in the same bottom field. 

Using the mentioned conversion method, the same type of 
deblocking filter is applied to the luminance components and the 

25 chrominance components which make up a same pixel in the same 
field. Therefore, it is possible to perform encoding and decoding so 
as to generate a consistent image when the image is displayed in the 
interlaced scan form. 

(3) no decimation 

30 FIG. 10B is a flowchart for explaining a flow of deblocking 

processing in the case where an image in which chrominance 
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components are not decimated is inputted. It differs with FIG. 8 
only in the deblocking performed on the chrominance components. 
Therefore, explanations about the same processing other than that 
is omitted in here. 
5 In order to perform deblocking on the chrominance 

components, a loop processing is repeated as many times as the 
number of pixels of the chrominance components adjacent to the 
target block boundary (F6 and F10); a type of filter is selected in 
each loop (F8); and the selected filter is applied (F9). Herein, a 

10 filter to be applied to the chrominance components is determined 
based on the type of filter used for the luminance components. 
That is, a type of filter applied at the pixel position of the 
corresponding luminance component is referred and used from the 
memory region in which information about types of filters are 

15 determined by the processing performed on the luminance 
component is stored. Herein, the pixel position of the target 
chrominance component is converted to a pixel position of the 
corresponding luminance component (F7c). 

The conversion of information about a position of a pixel is 

20 performed so as to have a position of a pixel of a luminance 
component assigned to the same field as a field assigned to a pixel 
of the chrominance component to be targeted when the target block 
is replaced to the field structure. Here, since the chrominance 
components are not decimated, the conversion is performed using 

25 the following equations. Note that, marks indicate the following: 
XL indicates a horizontal coordinate value of the luminance; XC 
indicates a horizontal coordinate value of the chrominance; Y L 
indicates a vertical coordinate value of the luminance; and YC 
indicates a vertical coordinate value of the chrominance. 

30 XL=XC (equation 4(a)) 

YL=YC (equation 4(b)) 

By applying the filter determined by the mentioned 
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processing, the deblocking is performed on the chrominance 
components. For each block, a boundary made up of a vertical 
edge on a left side and a boundary made up of a horizontal edge on 
the upper side are targeted. Therefore, in the case where the 
5 luminance component is a block of, for example, horizontal 4 x 
vertical 4 pixels, the chrominance component is horizontal 4 x 
vertical 4 pixels because it is explained as an example where the 
chrominance components are not decimated. Consequently, the 
processing is repeated eight times. 

10 FIGS. 12A and 12B are drawings for explaining a relationship 

between positions of pixels of target chrominance components and 
the positions of pixels of reference luminance components that are 
converted using the equations 4(a) and 4(b). FIG. 12A shows a 
positional relationship between luminance components and 

15 chrominance components in the frame structure. FIG. 12B shows a 
positional relationship between the luminance components and the 
chrominance components in the case where the image is replaced to 
the field structure. Here, L_0 indicates a position of luminance 
components at zeroth line; and C_0 indicates a position of 

20 chrominance components at zeroth line. In the case where the 
deblocking filter is applied to the color components of C_l, it is 
indicated to refer the luminance components of L_l using the 
equation 4(b). Replacing the pixels herein to the field structure, it 
is found that the type of filter applied to the chrominance component 

25 of C_l in the bottom field is determined by referring to the 
luminance component of L_l in the same bottom field. 

Using the mentioned conversion method, the same type of 
deblocking filter is applied for the luminance components and the 
chrominance components which make up a same pixel in the same 

30 field. Therefore, it is possible to perform encoding and decoding so 
as to generate a consistent image in the case where the image is 
displayed in the interlaced-scan form. 
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(Second Embodiment) 

In addition, by recording a program for realizing the layout of 
the moving image encoding method or the moving image decoding 
5 method as shown in each of the above-mentioned embodiments, on 
a recording medium such as a flexible disk, it becomes possible to 
perform the processing as shown in the above embodiment easily in 
an independent computer system. 

FIGS. 13A-13C are diagrams of a recording medium for 

10 recording a program for realizing the moving image encoding 
method and the moving image decoding method in the above 
embodiment in the computer system. 

FIG. 13B shows the front view of a flexible disk and the 
schematic cross-section, as well as a flexible disk itself, whereas FIG. 

15 13A shows an example of a physical format of the flexible disk as a 
recording medium body. A flexible disk FD is contained in a case F, 
a plurality of tracks Tr are formed concentrically on the surface of 
the disk in the radius direction from the periphery, and each track is 
divided into 16 sectors Se in the angular direction. Therefore, in 

20 the flexible disk storing the above-mentioned program, the moving 
image encoding method as the above program is recorded in an area 
allocated for it on the above flexible disk FD 

In addition, FIG. 13C shows the configuration for recording 
and reproducing the program on and from the flexible disk FD. 

25 When the program is recorded on the flexible disk FD, the computer 
system Cs writes in the moving image encoding method and the 
moving image decoding method as the program on the flexible disk 
FD via a flexible disk drive. When the above moving image 
encoding method and the moving image decoding method are 

30 constructed in the computer system using the program recorded on 
the flexible disk, the program is read out from the flexible disk via 
the flexible disk drive and transferred to the computer system. 
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Note that the above explanation is made on an assumption 
that a recording medium is a flexible disk, but the same processing 
can also be performed using an optical disk. In addition, the 
recording medium is not limited to these, but any other mediums 
5 such as a CD-ROM, memory card, and a ROM cassette can be used in 
the same manner if a program can be recorded on them. 

Further, the applications of the moving image encoding 
apparatus and the moving image decoding apparatus illustrated in 
the above embodiments and a system using them shall be explained 
10 here. 

FIG. 14 is a diagram showing the overall configuration of a 
content providing system exlOO for realizing content distribution 
service. The area for providing communication service is divided 
into cells of desired size, and base stations exl07 to exllO which 

15 are fixed wireless stations are placed in respective cells. 

In this content providing system exlOO, various devices such 
as a computer exlll, a personal digital assistant (PDA) exll2, a 
camera exll3, a cell phone exll4 and a camera-equipped cell 
phone exll5 are connected to the Internet exlOl, via an Internet 

20 service provider exl02, a telephone network exl04 and base 
stations exl07 to exllO, for example. 

However, the content providing system exlOO is not limited to 
the combination as shown in FIG. 14, and may be connected to a 
combination of any of them. Also, each device may be connected 

25 directly to the telephone network exl04, not through the base 
stations exl07 to exllO which are the fixed wireless stations. 

The camera exll3 is a device such as a digital video camera 
capable of shooting moving images. The cell phone may be any of 
a cell phone of a Personal Digital Communications (PDC) system, a 

30 Code Division Multiple Access (CDMA) system, a Wideband-Code 
Division Multiple Access (W-CDMA) system or a Global System for 
Mobile Communications (GSM) system, a Personal Handy-phone 
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System (PHS) and the like. 

Also, a streaming server exl03 is connected to the camera 
exll3 via the base station exl09 and the telephone network exl04, 
which realizes live distribution or the like using the camera exll3 
5 based on the encoded data transmitted from the user. The 
encoding of the data shot by the camera may be performed by the 
camera exll3, the server for transmitting the data, or the like. 
Also, the moving image data shot by a camera exll6 may be 
transmitted to the streaming server exl03 via the computer exlll. 

10 The camera exll6 is a device such as a digital camera capable of 
shooting still and moving images. In this case, either the computer 
exlll or the camera exll6 may encode the moving image data. 
An LSI exll7 included in the computer exlll or the camera exll6 
performs the encoding processing. Note that software for encoding 

15 and decoding moving images may be integrated into any type of a 
recording medium (such as a CD-ROM, a flexible disk and a hard 
disk) that is readable by the computer exlll or the like. 
Furthermore, the camera-equipped cell phone exll5 may transmit 
the moving image data. This moving image data is the data 

20 encoded by the LSI included in the cell phone exll5. 

In this content providing system exlOO, contents (such as a 
video of a live music performance) shot by users using the camera 
exll3, the camera exll6 or the like are encoded in the same 
manner as in the above embodiments and transmitted to the 

25 streaming server exl03, while the streaming server exl03 makes 
stream distribution of the above content data to the clients at their 
request. The clients include the computer exlll, the PDA exll2, 
the camera exll3, the cell phone exll4, and the like, capable of 
decoding the above-mentioned encoded data. The content 

30 providing system exlOO is a system in which the clients can thus 
receive and reproduce the encoded data, and further can receive, 
decode and reproduce the data in real time so as to realize personal 
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broadcasting. 

When each apparatus in this system performs encoding or 
decoding, the moving image encoding apparatus or the moving 
image decoding apparatus shown in each of the above-mentioned 
5 embodiments may be used. 

A cell phone is now explained as an example thereof. 
FIG. 15 is a diagram showing a cell phone exll5 which uses 
the moving image encoding method and the moving image decoding 
method as explained in the above embodiments. The cell phone 

10 exll5 has an antenna ex201 for communicating radio waves with 
the base station exllO, a camera unit ex203 such as a CCD camera 
capable of shooting moving and still images, a display unit ex202 
such as a liquid crystal display for displaying the data obtained by 
decoding video shot by the camera unit ex203, video received by the 

15 antenna ex201, or the like, a main body including a set of operation 
keys ex204, a voice output unit ex208 such as a speaker for 
outputting voices, a voice input unit ex205 such as a microphone for 
inputting voices, a recording medium ex207 for storing encoded or 
decoded data, such as data of moving or still images shot by the 

20 camera, and data of text, moving images or still images of received 
e-mails, and a slot unit ex206 for attaching the recording medium 
ex207 into the cell phone exll5. The recording medium ex207 
includes a flash memory element, a kind of Electrically Erasable and 
Programmable Read Only Memory (EEPROM) that is an electrically 

25 rewritable and erasable nonvolatile memory, in a plastic case such 
as an SD card. 

Further, the cell phone exll5 is explained with reference to 
FIG. 16. In the cell phone exll5, a power supply circuit unit ex310, 
an operation input control unit ex304, an image encoding unit ex312, 
30 a camera interface unit ex303, an Liquid Crystal Display (LCD) 
control unit ex302, an image decoding unit ex309, a 
multiplex/demultiplex unit ex308, a record/ reproduce unit ex307, a 
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modem circuit unit ex306 and a voice processing unit ex305, are 
connected to a main control unit ex311, and to each other, via a 
synchronous bus ex313. The main control unit ex311 is for the 
overall controlling of each unit of the main body including the display 
5 unit ex202 and the operation keys ex204. 

When a call-end key or a power key is turned ON by a user's 
operation, the power supply circuit unit ex310 supplies the 
respective units with power from a battery pack so as to activate the 
camera-equipped digital cell phone exll5 to a ready state. 

10 In the cell phone exll5, under the control of the main control 

unit ex311 including a CPU, ROM, RAM and the like, the voice 
processing unit ex305 converts the voice signals received by the 
voice input unit ex205 in conversation mode into digital voice data, 
the modem circuit unit ex306 performs spread spectrum processing 

15 of the digital voice data, and the communication circuit unit ex301 
performs digital-to-analog conversion and frequency transformation 
of the data, so as to transmit the result via the antenna ex201. 
Also, in the cell phone exll5, the communication circuit unit 
amplifies the data received by the antenna ex201 in conversation 

20 mode and performs the frequency transformation and an 
analog-to-digital conversion; the modem circuit unit ex306 
performs inverse spread spectrum processing of the data; and the 
voice processing unit ex305 converts it into analog voice data, so as 
to output the result via the voice output unit ex208. 

25 Furthermore, when transmitting an e-mail in data 

communication mode, the text data of the e-mail inputted by 
operating the operation keys ex204 of the main body is sent out to 
the main control unit ex311 via the operation input control unit 
ex304. In the main control unitex311, after the modem circuit unit 

30 ex306 performs spread spectrum processing of the text data and the 
communication circuit unit ex301 performs a digital-to-analog 
conversion and frequency transformation for the text data, the data 



-32- 



is transmitted to the base station exllO via the antenna ex201. 

When image data is transmitted in data communication mode, 
the image data shot by the camera unit ex203 is provided to the 
image encoding unit ex312 via the camera interface unit ex303. 
5 When the image data is not transmitted, the image data shot by the 
camera unit ex203 can also be displayed directly on the display unit 
202 via the camera interface unit ex303 and the LCD control unit 
ex302. 

The image encoding unit ex312, including the moving image 

10 encoding apparatus explained in the present invention, compresses 
and encodes the image data provided from the camera unit ex203 by 
the encoding method used for the moving image encoding apparatus 
as shown in the above-mentioned embodiments so as to transform it 
into encoded image data, and sends it out to the 

15 multiplex/demultiplex unit ex308. At this time, the cell phone 
exll5 sends out the voices received by the voice input unit ex205 
during the shooting by the camera unit ex203, to the 
multiplex/demultiplex unit ex308 as digital voice data via the voice 
processing unit ex305. 

20 The multiplex/demultiplex unit ex308 multiplexes the 

encoded image data provided from the image encoding unit ex312 
and the voice data provided from the voice processing unit ex305 
using a predetermined method, and the modem circuit unit ex306 
then performs spread spectrum processing of the multiplexed data 

25 obtained as the result, and the communication circuit unit ex301 
performs digital-to-analog conversion and frequency transformation 
on the result, for the transmission via the antenna ex201. 

As for receiving data of a moving image file which is linked to 
a website or the like in data communication mode, the modem 

30 circuit unit ex306 performs inverse spread spectrum processing of 
the data received from the base station exllO via the antenna 
ex201, and sends out the multiplexed data obtained as the result to 
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the multiplex/demultiplex unit ex308. 

In order to decode the multiplexed data received via the 
antenna ex201, the multiplex/demultiplex unit ex308 demultiplexes 
the multiplexed data into an encoded bit stream of image data and 
5 an encoded bit stream of voice data, and provides the encoded 
image data to the image decoding unit ex309 and the voice data to 
the voice processing unit ex305, respectively via the synchronous 
bus ex313. 

Next, the image decoding unit ex309, including the moving 
10 image decoding apparatus explained in the present invention, 
decodes the encoded bit stream of the image data using the 
decoding method corresponding to the encoding method as shown in 
the above-mentioned embodiments, so as to generate reproduced 
moving image data, and provides this data to the display unit ex202 
15 via the LCD control unit ex302, and thus moving image data 
included in a moving image file linked to a website, for instance, is 
displayed. At the same time, the voice processing unit ex305 
converts the voice data into analog voice data, and provides this 
data to the voice output unit ex208, and thus voice data included in 
20 the moving image file linked to a website, for instance, is 
reproduced. 

The present invention is not limited to the above-mentioned 
system since satellite or ground-based digital broadcasting has been 
in the news lately, and at least either the moving image encoding 

25 apparatus or the moving image decoding apparatus in the 
above-mentioned embodiments can be incorporated into a digital 
broadcasting system as shown in FIG. 17. More specifically, an 
encoded bit stream of video information is transmitted from a 
broadcast station ex409 to a communication or broadcast satellite 

30 ex410 via radio waves. Upon receipt of it, the broadcast satellite 
ex410 transmits radio waves for broadcasting, a home antenna 
ex406 with a satellite broadcast reception function receives the 
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radio waves, and an apparatus such as a television (receiver) ex401 
or a Set Top Box (STB) ex407 decodes the encoded bit stream for 
reconstruction. The moving image decoding apparatus as shown in 
the above-mentioned embodiments can be implemented in a 
5 reproduction device ex403 for reading and decoding an encoded bit 
stream recorded on a storage medium ex402 such as a CD and DVD 
that is a recording medium. In this case, the reproduced video 
signals are displayed on a monitor ex404. It is also conceived to 
implement the moving image decoding apparatus in the set top box 

10 ex407 connected to a cable ex405 for cable television or the antenna 
ex406 for satellite and/or ground-based broadcasting so as to 
reproduce them on a monitor ex408 of the television. The moving 
image decoding apparatus may be incorporated into the television, 
not in the set top box. Also, a car ex412 having an antenna ex411 

15 which can receive signals from the satellite ex410, the base station 
exl07 or the like, for reproducing moving images on a display device 
such as a car navigation system ex413 or the like in the car ex412. 

Furthermore, the moving image encoding apparatus as shown 
in the above-mentioned embodiments can encode image signals and 

20 record them on a recording medium. As a concrete example, there 
is a recorder ex420 such as a DVD recorder for recording image 
signals on a DVD disk ex421 and a disk recorder for recording them 
on a hard disk. They can also be recorded on an SD card ex422. If 
the recorder ex420 includes the moving image decoding apparatus 

25 as shown in the above-mentioned embodiments, the image signals 
recorded on the DVD disk ex421 or the SD card ex422 can be 
reproduced for display on the monitor ex408. 

As for the configuration of the car navigation system ex413, 
the configuration without the camera unit ex203, the camera 

30 interface unit ex303 and the image encoding unit ex312, out of the 
units as shown in FIG. 16, is conceivable. The same applies to the 
computer exlll, the television (receiver) ex401 and others. 
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Moreover, three types of implementations can be conceived 
for a terminal such as the above-mentioned cell phone exll4; a 
communication terminal equipped with both an encoder and a 
decoder, a sending terminal equipped with an encoder only, and a 
5 receiving terminal equipped with a decoder only. 

As described above, it is possible to use the moving image 
encoding method or the moving image decoding method described 
in the above embodiments in any of above-mentioned devices and 
systems, and thus the effects explained in the above embodiments 
10 can be obtained. 

Also, the present invention is not limited to the above 
mentioned embodiments so that various modifications and 
corrections can be made within the scope of the present invention. 
Further, each functional block shown in the block diagrams 
15 FIG. 1 and FIG. 2 is typically realized as a LSI which is an integrated, 
circuit. This LSI may be constructed in one chip or in a plural chips 
(for example, a functional block other than memory may be 
constructed in one chip.) Here, it is called as LSI. However, it 
can be called as an IC, a system LSI, a super LSI, or an ultra LSI 
20 depending on an integration density. 

In addition, a method of constructing an integrated circuit is 
not limited to the LSI so that it may be realized as a private circuit 
or as a general processor. After the LSI is manufactured, a Field 
Programmable Gate Array (FPGA) capable of programming, and a 
25 reconfigurable processor which is can reconfigure connection and 
setting of circuit cells in the LSI may be used. 

Furthermore, there is no doubt that, if a technology of 
constructing an integrated circuit is introduced as a replace of the 
LSI by a development of semiconductor technology or a derived 
30 another technology, the functional block may be integrated using 
the new technology. There is a possibility to apply a biotechnology 
and the like. 
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Industrial Applicability 

As described in the above, a moving image encoding method 
and a moving image decoding method according to the present 
5 invention are useful as methods for encoding each picture which 
makes up an inputted image, outputs as a moving image encoded 
data, and decode the moving image encoded data, using, for 
example, a cell phone, a DVD apparatus, a personal computer and 
the like. 
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